Integrating Data Modeling and Dynamic Optimization using Constrained Reinforcement Learning

نویسندگان

  • Naoki Abe
  • Prem Melville
  • Chandan K. Reddy
  • Cezar Pendus
  • David L. Jensen
چکیده

In this paper, we address the problem of tightly integrating data modeling and decision optimization, particularly when the optimization is dynamic and involves a sequence of decisions to be made over time. We propose a novel approach based on the framework of constrained Markov Decision Processes, and establish some basic properties concerning modeling/optimization methods within this formulation. We conduct systematic empirical evaluation of our approach on resource-constrained versions of business optimization problems using two real world data sets. In general, our experimental results exhibit steady convergent behavior of the proposed approach in multiple problem settings. They also demonstrate that the proposed approach compares favorably to alternative methods, which loosely couple data modeling and op-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Model-Based Reinforcement Learning in Continuous Environments Using Real-Time Constrained Optimization

Reinforcement learning for robot control tasks in continuous environments is a challenging problem due to the dimensionality of the state and action spaces, time and resource costs for learning with a real robot as well as constraints imposed for its safe operation. In this paper we propose a modelbased reinforcement learning approach for continuous environments with constraints. The approach c...

متن کامل

Reinforcement Learning for Fair Dynamic Pricing

Unfair pricing policies have been shown to be one of the most negative perceptions customers can have concerning pricing, and may result in long-term losses for a company. Despite the fact that dynamic pricing models help companies maximize revenue, fairness and equality should be taken into account in order to avoid unfair price differences between groups of customers. This paper shows how to ...

متن کامل

Multi-Agent Evolutionary Game Dynamics and Reinforcement Learning Applied to Online Optimization of Traffic Policy

This chapter demonstrates an application of agent-based selection dynamics to the traffic assignment problem. We introduce an evolutionary dynamic approach that acquires payoff data from multi-agent reinforcement learning to enable a adaptive optimization of traffic assignment, provided that classical theories of traffic user equilibrium pose the problem as one of global optimization. We then s...

متن کامل

Decentralized multi-agent reinforcement learning in average-reward dynamic DCOPs

Researchers have introduced the Dynamic Distributed Constraint Optimization Problem (Dynamic DCOP) formulation to model dynamically changing multi-agent coordination problems, where a dynamic DCOP is a sequence of (static canonical) DCOPs, each partially different from the DCOP preceding it. Existing work typically assumes that the problem in each time step is decoupled from the problems in oth...

متن کامل

Optimization of e-Learning Model Using Fuzzy Genetic Algorithm

E-learning model is examined of three major dimensions. And each dimension has a range of indicators that is effective in optimization and modeling, in many optimization problems in the modeling, target function or constraints may change over time that as a result optimization of these problems can also be changed. If any of these undetermined events be considered in the optimization process, t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008